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Abstract 

We consider two problems of constructing of goodness of fit tests 
for ergodic diffusion processes. The first one is concerned with a 
composite basic hypothesis for a parametric class of diffusion pro- 
cesses, which includes the Ornstein-Uhlenbeck and simple switching 
processes. In this case we propose asymptotically parameter free tests 
of Cramer-von Mises type. The basic hypothesis in the second prob- 
lem is simple and we propose asymptotically distribution free tests for 
a wider class of trend coefficients. 
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1 Introduction 

In this paper we consider two different goodness of fit (GoF) hypotheses 
testing problems for the diffusion process 

dXt = S (Xt) dt + a (Xt) dWt, Xo, 0<t<T. 

In the first problem the observed process under the basic hypothesis (Hq) 
satisfies the stochastic differential equation 

dXt = -/3 sgn {Xt - a) \Xt - dt + a dWt, Xq, 0<t<T, 

where "i? = {a,P) G is the unknown parameter, /3 > 0,7 > and a > 0. 
Therefore the hypothesis is parametric composite. 
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In the second problem we assume that under the basic hypothesis ("Ho) 
the observed process satisfies 

dXt = So (Xt) dt + a (Xt) dWt, Xo, 0<t<T, 

where 5*0 (■) is a known function, i.e., ("Ho) is simple. 

In both models the alternatives are nonparametric and, under the hypoth- 
esis Hq, the diffusion processes are assumed to be ergodic with the invariant 
densities / (t?, x) and fs^ (x) respectively. We denote the corresponding dis- 
tribution functions by F and Fs^ (x). 

Our goal is to construct the goodness of fit tests which provide the fixed 
limit error e G (0, 1). Introduce the class /Cg of such tests, i.e., the tests -0^ 
satisfying the relations 

lim E^ipT = e for all G 6, 

T-i^oo 

and 

lim BsotpT = e 

in the first and the second problems respectively. 

All tests studied in the present work are of the form ipT = ^{At>c^}, 
where At is the Cramer-von Mises type statistic. More precisely, in the 
first problem A^^ is either of the L2 distances D (^F^ (x) , F (J}t,x^^ and 

D ^/t (x) ,/ (J}t,x^^, where Ft{x) is the empirical distribution function, 

/r (x) is the local time estimator of the invariant density and dx is the 
maximum likelihood estimator (MLE) of the parameter §. Similarly, in 

the second problem A^- is one of the distances D {^Ft {x) ^ Fsq{x)^ and 

D(^fT{x),fsA^))- 

Let us denote by A {d) and A {Sq) the limits (in distribution) of the test 
statistics in the first and the second problems. Then the thresholds q in 
these tests have to satisfy the equations 

{A [d) > cj = e, P50 {A (S-o) > cj = e. (1) 

The main contribution of this work is the following. We introduce modifi- 
cations of the statistics A^, so that their limit distributions do not depend 
on in the first problem and do not depend on 5*0 (■) in the second prob- 
lem. Therefore the corresponding tests are asymptotically parameter free in 
the first case and asymptotically distribution free in the second case. These 
modifications essentially simplify the solution of the equations ([T]) and allow 
to choose the thresholds q before actually conducting the experiments. 
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Let us briefly recall what happens in the analogous problems in the case of 
independent identically distributed observations Xi, . . . , In the problem 
of the first type we have the following results. Suppose that under the basic 
hypothesis 

: X, ~Fo(i9,x), deQ, 

where Fq{-,x) is some known distribution function. The limit distribution 
of the Cramer-von Mises staistics (under hypothesis T/q) 

A„ (X") =n I (x) - Fo Un, a;) 1 ' dFo x) ^ A {d) 

depends on 'd. Here F„ (x) is the empirical distribution function and 'dn is 
some estimator. The choice of the threshold q for the GoF test 

V'n = lI{A„(X")>c,} 

can be a difficult problem, since q = q (■!?) is solution of the equation 

{A {^) > cj = e. 

It is well-known that for some distributions, say with shift and scale 
parameters like F^^^j, this limit can be asymptotically parameter free 
(APF). For example, if the hypothesis is 

Ho : X, ~ A/" (a, , ^={a,P)eQ, 

then the limit distribution of the Cramer-von Mises staistics 

An {Xn [f„ (x) - Fo f ^) 1 ' dFo f ^) 

does not depend on d (see, e.g., [I0],[5], [I3]). 

Here Fo (x) is the distribution function of (0, 1) random variable. There- 
fore the threshold q does not depend on d and the test can be easily 
constructed. The similar statement for Pareto distribution was studied by 
Choulakian and Stephens [1] and another class of distributions was treated 
by Martynov |14] . 

The general case of ergodic diffusion processes with one-dimensional shift 
parameter was studied by Negri and Zhou [16]. They showed that the limit 
distribution of the Cramer-von Mises statistic does not depend on the un- 
known parameter. 
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2 Preliminaries 

We need some properties of the estimators Ft (x) and fr (x), which we recall 
below. We assume that the trend S (x) and the diffusion a (x)^ coefficients 
of the observed diffusion process 

dXt = S (Xt) dt + a (Xt) dWt, Xo, 0<t<T 

satisfy the following conditions. 

SS. The function S (■) is locally bounded, the function a (■)^ > is con- 
tinuous and for some C > the condition 



X 



S{x) + (T{xf <C{l + x^) 



holds. 

Under this condition the stochastic differential equation has a unique 
weak solution (see, e.g., [Hj). 
Let us denote 



y s{z) 

' [ Jo <y{zy 



V{x)= I exp <j -2 / —r^dz } dy 



and 



GiS)= I aiy)-'exp{2l ^^dy\dx. 



' S (y) 



The next condition is: 

TZV. The functions S (■) and a (■)^ are such that 

G (S) < oo, V (x) — > ±oo as x — )■ ±oo. 

Under this condition the diffusion process is ergodic, i.e., positive recurrent 
with the invariant density 

/ (x) = i 2 exp (2 r dy 

G{S) a{xY ' \ Jo a{yf 

The empirical distribution function Frp (x) and the local time density 
estimator /r (x) of the invariant density / (x) are 



1 r 

Ft {x) = f I{Xt<x} dt, fx (x 



Ta ix)^ 



where the local time (x) satisfies the equation (Tanaka- Meyer formula) 
At (x) = \Xt - x| - |Xo - x| - / sgn {Xt - x) dXt- 



Recall that these estimators are consistent, asymptotically normal and 
asymptotically efficient under the basic hypothesis (see [H]). The proof of 
these properties is based on the representations 



and 



^ J_ F [y) F [x)- FjyAx) 
VT Jxo (t {yf f (y) 



+ ^r5i-l^d.. (3) 
VT Jxo a {yf fiy) ' 



Using these representations and the central limit theorem for stochastic in- 
tegrals we obtain the limits in distribution 



VT(fTix)-fix))^2f{x) r IM^:!^ dW (y) , 

where W {■) is a two-sided Wiener process. 

The estimator fri^) is the a.s. derivative of Ft{x). Indeed, using the 
equality (see [T7] ) 

Tm^O dt= r h{y)^^dy 
Jo J-oo aiy) 



we can write 



/CO A / \ PX 

^{y<^} It (y) dy. 

As the local time is continuous with probability one we have the limit 



x+a 



Ft [x + a)- Ft [x) 1 . r f ^ . . / n 

hm = hm - / /t [y) dy = Jt (x) . 

Therefore we can call the local time estimator the empirical density. It is 
easy to see that the representation ([3|) can be obtained from ([2]) through 
differentiating. 



5 



Introduce the class V of locally bounded functions with polynomial ma- 
jorants {p > 0) 

V = {h{.): \h{y)\<C{l + \yn}. 

and the following condition: 

^0- The functions (■) , cr (■)^^ e V and 

- — S (y) 

hm sgn [y) ——^ < 0. 

\vH°o (J [y) 

Note that if iS (■) and a (■) satisfy then the condition TZV is fulfilled. 

Moreover, under condition for any p > there exist k > and C > 
such that 



E 



T(fT{x)-f{x) 



For the proof see Proposition 1.11, [TT] . 



3 Asymptotically Parameter Free Tests 

The first problem is the following. We observe an ergodic diffusion process 
X'^ = {Xt, < t < T), which solves the equation 

dXt = S (Xt) dt + a dWt, Xq, 0<t<r (4) 

and we have to test the composite basic hypothesis: 

T-Lo this process admits the stochastic differential 

dXt = -Psgn{Xt-a)\Xt-a\'^ dt + adWt, Xo, < t < T, (5) 

where ^ = (a, /3) is the unknown parameter, t) E Q = {ai, 02) y< {bi, 62) , bi > 
0. 

against the nonparametric alternative: 

"Hi the observed process does not belong to this parametric family. 
The parameters 7 > and a > are assumed to be known. 
Note that if 7 = 1, then we obtain Ornstein- Uhlenbeck process 

dXt = -P {Xt -a) dt + a dWt, Xq, 0<t<T 

and if 7 = then the solution of ([5]) is the simple switching process 

dXt = sgn{Xt-a) dt + a dWt, Xq, 0<t<T 



studied in [TT], Section 3.4. For 7 = 3 we have the cubic trend 



AXt = -P {Xt - of dt + a dWt 



X, 



0<t<T. 



It is easy to verify that for /3 > 0, 7 > this process is positive recurrent 
with the invariant density 



where F (■) is the Gamma function. Below we denote by /o (x) = / (^?o, x) and 
Fq (x) = F {do, x) the density and the distribution function corresponding to 
the values -i^o = (0, 1) , cr = 1 and we denote by ^ the random variable with 
such distribution function. 

It will be convenient to study the cases 7 > 1 (including 0-U process) 
and < 7 < I separately because the rates of convergence of the MLE dj- 
in these two cases are essentially different. We defer the discussion of the 
complementary case 7 G [|, 1) to section 3.3 below. 

3.1 Case 7 > 1. 

Let us consider the following ergodic diffusion process as the basic model 
(under hypothesis "Ho) 

dXt = -f3 sgn {Xt - a) \Xt - dt + a dWt, Xq, 0<t<T, 

where = (a, /3) G 9 = (ai, 02) x {bi, 62), fei > and 7 > 1. 

Recall that the MLE dx = (^&t, /^t^ of the parameter "i? is consistent and 
asymptotically normal. Moreover, the moments of this estimator converge 
too (see Theorem 2.8, [TT|): 




The normalizing constant is 




supT^E^ dr-d <C, 



(7) 



for any p > 0. 
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3.1.1 The Test Based on Empirical Distribution Function 

We study the test 

where the test statistic is 

At (X^) = P^a'-^T j \Ft (x) - F Ut. x 

J —oo 

Let us introduce the random variable 



dF (^T,x) . 



$ (y) + - /o (?/) + /o (?/) 

7a (7 + 1)0 



fo (y) dy, 



where 



$(y) = 2 

n = 
^ = 



Fo{z)Foiy)-FoizAy) 

/My) 



17- 



-00 

CXD 



sm [z] \z 



' ./f^iz) dW (z) , a = Eo|^|'^-' 

v(M^diy(z), 6 = Eo|e|'". 



Here W {■) is two-sided Wiener process. The constant is defined by the 
equation 

P (A > c,) = e. 

The distribution of the random variable A is not known in a closed form 
but the value q can be easily obtained with the help of the Monte Carlo 
simulations. Let us stress that this value is the same for all and therefore 
can be calculated before the experiment. 
Our first result is 



Theorem 1 The test ^jt {X^) G /C^. 
Proof. We have the relation 

^(Ft{x)- f(^^t,x^^ 

= Vt (^Ft (x) - F x)^ +Vt(^F (^9, x)-F [dT, 



X 



Vt {x) 



T['dT-'& 



dF {d,x] 



+ Tt- 
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Here [yf{dT-'d), 
process r]T {x) is given by 



riT {x) 



is the usual scalar product in B? and stochastic 
^ F Xt) F (^9, x)-F (^?, Xt A x) 



CTf{^,Xt 



The convergence r — > follows from the representation the estimate 



sup 



^ F{^,y)F{^,x) -F{^,yAx) 

my) 



dy] <C, 



which can be obtained by direct calculation (see [TT], Theorem 4.6) and the 
estimate ([7]). Note that the density / (t?, x) has exponentially decreasing tails 
and all necessary estimates can be derived in the straightforward way. 
Define the random functions 



Vt [X) 



O F {'d, x] 



X E R, 



rjo (x) = $ (^/) + — /o (y) + ^ ^ fo (y) , xE R. 
7a (7 + 1) 



We have to verify that 



Vt {xf f{^T,x) dx 



Vo {xf fo {x) dx. 



(9) 



We start with the convergence of the finite dimensional distributions. The 
form of the invariant density ([6]) suggests the following change of variables 

1 

Zt = {Xt - a) . 
This process satisfies the following stochastic differential 



2 2(7-1) 



1 7-1 



dZt = -sgn(Zt) |Zt|^ d (t/?^a"^Tr^ ) + /3^a^dWt, 0<t<T. 
Therefore, if we denote 



Ys = Z 



2 2(7-1) , 

" 7+1 cr 7+1 



^ 2 2(7-1) 2 2(7-1) 



the process Yg satisfies the equation 

dYs = -sgn (Ys) |y,r ds + dws, Yq, < s < 
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where Ws = ctt+i (here t = 3(3 t+i a t+i ) is another Wiener process. 
Obviously the process Yg is ergodic with the invariant density /o (x) . 

1 2_ 

Let us define y = P'f+^cr t+i [x — a). Then we can write 

F x) = Fo (y) , / (i?, x) = /3^cr-^/o (?/) . 



For the stochastic process t/t (x) this change of variables gives the represen- 
tation 

2 r^ F{^,X,)F{^,x)-F{^,xAX,) 



VT Jo /3^iiCT7+i/o(Zi) 



F* Jo /o (^s) 



= /3 f+^a ^+1 <I>T, (y) , 

where the last equality defines the random function $5-^ (y). 
Introduce the following integrals 

1 r'^* 1 r^* 

V -L* Jo y J-* Jo 

-L* Jo -'-* Jo 



1 



cn = ^ I sgn(n)|y,|'^-Ms. 

Note that by the law of large numbers we have 

— > a., br, — > b 

The invariant density fo (y) is a symmetric function and therefore 

CT. ^E^„(sgn(0|el'""') =0. (10) 

The random variables nj^^ and ■07-^ are asymptotically normal by the central 
limit theorem 

and due to ffTOl) they are asymptotically independent. 
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The MLE dx admits the following representation 

^ dS X,] 



dWt + o{i), (ii; 



^ ^ ayT Jo 

where It {'&) is the 2x2 matrix 

Here r means transposition. For the proof of this representation see Theorem 
2.8 in [H] and Theorem 8.1 in [9]. The convergence (1T0|) allows us to consider 
the information matrix as asymptotically diagonal. 

Note that for the trend coefficient S (i), x) = —/3 sgn {x — a)\x — al^ we 



have the equality ^^j^^^ = PjIx — a 



17-1 



We have 



Ta^ Jo 
1 

TV 



da 



2(7-3) 4 

dt = 7 cr ^+1 ar,, 



1 fdSid,Xt)V , 2(7-1) 27 ^ 



and 



1 



1 







da 
^ dS X 
9^ 



Further 



dF x) I3~ 



da 

Therefore we can write 



= — ^ /o {y) 



dF 

d(5 



/3(7 + l) 



/o {y) ■ 



dF {d,x] 



dd 



VT{ar-a) + ^ [p^ - P 



1 1-7 

-/3 7+icr^+i 



da 

Tlx, 



+ 



7 ar, (7 + 1) . 



x) 

dp 

/o(y) + o(l). 
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This, in turn, allows us to write 



T(Ft{x) - F[dT,x^ 



1 1 — t 

j3 7+1(7^ + 1 



Tlx, 

7 ot. 



■/o {y) 



(7 + 1) f'T. 



-/o {y) 



o(l) 



Finally we obtain 



-/o (y) 



-/o (y) 



/o(i/) dy + o{l). 



7«t/ ■ ■ (7+1) ^T. 
Now we can replace qt, and ^t, with their limits a and & and denote 

vn (y) = (y) + —k (y) + r^zTViA ■ 

7 a (7 + 1) 



We shall verify the convergence 



Vn {yf fo (y) dy 



Vo {yf fo (y) dy. 



(12) 



To prove it, we shall check the following three conditions. 



1. The finite dimensional distributions of riT^ {■) converge, i.e., for any 
k > 1 and any yi, . . . ,yk we have the convergence 

ivn (2/1) ,---,Vn (yk)) =^ ivo (2/1) ,---,Vo (yk)) ■ 

2. There exists a constant Ci > such that 



Ei»o \r]T, (2/2) - m, {yi)? < Ci I2/2 - 2/1 1 
3. There exist constants C2 > and k > such 



(13) 



(14) 



If these conditions hold, (IT^ follows from the results of [5]. Indeed, by 
Theorem A. 22 in [H] integrals converge on any finite interval [—L, L] and 
outside of this interval we can estimate the tail integrals as in the proof of 
the Theorem 1.5.6 [9]. 
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By the central limit theorem for stochastic integral we obtain the desired 
joint asymptotic normality 

($T. (2/1) , . . . , ivk) , vtt,, V'tJ =^ (2/1) ivk) , n, , 

which proves convergence of the finite dimensional distributions of fixi-)- 
Further, for all 2/1,2/27 12/2 — 2/i| ^ 1 we have the estimate (2/1 < 2/2) 

E^o I^T (2/2) - f]T (2/1)1^ < 3E^„ |<I>T. (2/2) - (2/1)1^ 

+ 7 TT27; 1^2/0 (2/2) - 2/1/0 (2/1)1^ E^„V^2^ 

(7 + 1) ¥ 

+ ^ 1/0 (2/2) - /o (2/1)1' E^„4 < C I2/2 - 2/1 1' 

because 

\2 rn ^ \ n / m2 



F icb r ^ cb r 4 r ^o(^) [^0(2/2) -i^o(2/ir , 

E^o |$Ta2/2) - (2/1)1 =4/ d^ 

J~oo Jo [z) 

^ 4 r i^o(^)'[^o(2/2)-Fo(2/i)]' ^^. 

/o (2) 

^4 (Fq (Z) [Fq (j/2) - Fq (j/i)] - Fq (^) + Fq (j/i))^ ^_ 

<C\y2-y,\\ 
Therefore, we obtain f|T3|) . To prove (fT^ . write 



E^oT/T (2/)' < 3E^„$T, (2/)' + ^2/Vo (2/)' E^„vl/^, + 4^/0 {yf E.^U^ 

(7 + 1) If a 

< 12 r [^oMF.fa) F,(.A.)l' ^^ + C (1 + , (.)^ ^ 

Jo l^^j 

Further, using the same arguments as in [11], Example 4.1.3, we obtain the 
estimate 

J-00 Jo Jo [z) 

with some constants C > 0, k > 0. For example, for the large values of z we 
can write 

l2 / ;.oo . „ . \ 2 



[Fq jz) - 1]- 
fo{z) 



(^j^ exp I -cv?+^ + I du^ 



<{^j^ exp|-|u^+^|dn^ <Ce 



-C2 ' 
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and so on. 

Therefore the conditions of the weak convergence of integrals are verified 
and the test iIjt ^ JCe- 

The consistency of this test is imphed by the elementary inequalities as 
follows. Suppose that the trend coefficient S (x) of the observed process (j4]) 
does not belong to the given parametric family S {'&,x) & [ai, 02] x [bi, 62]. 
It is known (Proposition 2.36, [TT]) that the MLE 'd^ converges to the value 
i9 which minimizes the KuUback-Leibner distance between the parametric 
family and the true distribution: 



"!? = arg inf E c 



Here the random variable has the invariant density function fs {x). It can 
be shown that 

Ft{x) - F{&T,x) dF{'i}T,x)^ F{x)-F{^,x) dF{^,x)>Q. 

Therefore for the statistic Ay we have 

At — > 00 and P5 {Ay > -> 1. 
Hence the test is consistent. 



3.1.2 The Test Based on Empirical Density 

Now we study the test 

where the test statistic is 



St {X^) = a'T / k (x) - / Ut 



,x 



dF [dT,x] . 



Let us denote Ct (1?, x) = \fT {^t [x) — f x)j and write 



df (79, x) 



Vf (/t (x) -f(dT,x^^=CT x)-VT i&T - a, 

df X 



d(3 



oil) 
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Using the same arguments as above we obtain the representations 

2/0 (y) /•^*Fo(n)-I|y,>,} 



df {^,x) 

da 
df {^,x) 

8(3 



(y\/% Jo fo (Ys) 

a-'^TAy)fo{y) + o{i), 



-dws + (11 



-2/3^+1 a ^+isgn(y) Ivl"^ fo{y) 



/3 7 + 1(7 7 + 1 

7 + 1 



[l-2\yr']foiy). 



These equahties together with the representations of the estimators ctT and 
/3t allow us to write 



/o(?/) + o(l). 



aVT{fTix)-f{^T,x) 



iy) + 2sgn (y) \y\^^+[l-2 l^yP+^j 

7a '- -' (7 + 1) 



Using the same arguments as in the section 3.1.1 the following convergence 

/oo 
Co (yf fo (y) d?/, 
-00 

can be proved. Here 



Co{y) 



$ (y) + 2sgn (y) \y\ 



n 

7a 



[1 - 2 ly^'] 



(7 + 1) 



/o(y) 



$(?/) = 2 r^^^i^tii^dW^U). 



Hence the test ipT = I{<5t>ce} is APF. The threshold q is defined by the 
equation 

P{S>c,} = e 

and therefore it belongs to /C^. 
3.2 Case < 7 < 

If we observe (under hypothesis "Ho) the same equation 

dXt = -P sgn {Xt - a) \Xt - dt + a dWt, Xq, 0<t<T. 

but with 7 G [0, |), then the main difference with 7 > 1 is due to the rate of 
convergence of the MLE As the rate is faster than y/T the contribution 
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of this estimator to the hmit distribution of test statistic is neghgeable. This 
property of the test statisics was mentioned by Darhng 
Recall that in the case 7 = (simple switching) 

dXi = -(3 sgn {Xt - a) dt + adWt, Xq, < t < T. 

we have the convergence 

T {aT — tt) =^ u, Z {u) = sup Z (u) 

u 

where 

Z (u) = exp jiy (u7^) - -^^^ 

Here W {■) is double sided Wiener process and 7^ > is some constant (see 
details in [TT], Section 3.4). Moreover, we have the convergence of moments 
too: for any p > 

E^|r(dT-a)|^ — >'Ei)\uf . 
Therefore, if we repeat the proofs above, we shall see that 



/- , OF {j}, x) ^( l\ 

v^(«T-«)^^ = 0^-^J 



and 



[aj- — a 



da 



O 



Of course, we have to be careful with the second term because the invariant 
density is 



f{d,x) 



/3 



exp 



2/3 



X — a\ 



and the derivative is not continuous. However, the function / (■!?, x) is abso- 
lutely continuous and this is sufficient for the proof. 
The Cramer-von Mises type statistics are 

I 

r2 



a 



^ ' a 
and their limits are 

St {X^) ^ 



2 rp 



00 
00 



Ft{x) -F (dT,x 



dF l^T,x 



fT{x)-f{dT,X 



dF I 'dx, X 



'OO 

00 



'^{y) + y fo (y) 



$(2/) + [l-2|2/|] 



(7 + l)b 



(7 + 1)6, 



fo iv) dy, 



foiyf(^y- 
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Hence the corresponding tests belong to /Cg. 
If 7 G (O, I), then we have a different hmit 

T^T+i (aj^ — a) =^ Z [u] = sup Z (u) 

u 

with Z {u) = exp jvt^^ (uT^) - ^^1^1 . Here (■) is two sided fractional 

Brownian motion, H = k + ^ (Hurst parameter) and is some constant. 
We have the convergence of moments too: for any p > 

For the proofs see [2] or [11], Section 3.2. 

Hence once again we can use the tests ipT and i/jt and the limits of the 
test statistics are obtained by setting H = in A and 6 respectively. 




3.3 Discussion 



The case 7 G [^,1) was not included in this study because the appropriate 
properties of the MLE ax are available only in the cases 7 G [0, |) [2] and 
7 > 1 [11]. In the case 7 G (|, 1) the derivative of the trend with respect 
to parameter a is no more locally bounded, but the singularity at the point 
X = a is integrable and the proof presented in [11], Theorem 2.8 can be 
carried out. Note that the Fisher information is bounded. Therefore in this 
case we obtain the same result as for 7 > 1. This is not the case if 7 = | and 
for this model we need a special study. Note that the rate of convergence of 
the MLE is better than \/T and the limit distribution of the test statistic 
have to be the same as for 7 G (0, |). 

Note that our proofs (strengthened up to the weak convergence in Cq (R)) 
imply the following limits for the Kolmogorov-Smirnov type statistics 



At (X^) = a W sup Vf 



It (X^) = crsup fr [x) - f (^t, x 



Frix) -FidT.x 



sup 1 770(2/) I , 



sup I Co (y) I 
y 



where the limit distributions do not depend on d. Hence the goodness of 
fit tests based on these statistics are AFF. It will be interesting to consider 
other models with AFF tests. 
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4 Asymptotically Distribution Free Tests 



Now we consider the second statement of the goodness of fit hypotheses 
testing problem. The basic hypothesis Ho is simple: the observed diffusion 
process satisfies the stochastic differential equation 

dXt = So{Xt) dt + a{X,) dWt, Xo, 0<t<T, 

where Sq (■) and a (■) are known functions. We assume that the conditions SS 
and ^0 hold. Therefore, the stochastic process are ergodic with the invariant 
density 



1 



exp < 2 



' So jy) 
(^{y)^ 



dy 



G{So)cr{x) 

Of course, we can use the Cramer-von Mises type statistics based on the 
empirical density 



1 2 



St (X^) = T 
but its limit under hypothesis is 

6 (So 

where 

(^{y)Vfso{y) 

is the limit in distribution of the normalized difference 



fr {x) - fso (a;)J dFs^ (x) , 



C (So, xf dFso (x) , 

o 



Ct ('S'o, x) = Vf {Jt {x) - fso (x)^ 



(see the representation ([3])). 

Therefore, for the test (/)t{X'^) = l^sT{xT)>ce} we have to solve the equa- 
tion 

PsoiHSo)>Ce)=e 

and we see that the threshold = q (S'o). 

Recall that for the i.i.d. observations the limit of the corresponding statis- 
tics based on empirical distribution function is 



n 



Fn (x) - Fo (x) 



dFo(x 



B{tfdt, 
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where B (t) ,0 < t < 1 is the Brownian bridge. Hence the test (pn = I{a„>ce} 
with Ce from the equation P (A > q) = e belongs to the class /Cg. Tests 
based on statistics with limit distributions independent of the model under 
hypothesis are called asymptotically distribution free (ADF). There are sev- 
eral works devoted to the construction of ADF tests for ergodic diffusion 
processes observed in continuous time. We can mention here [Z],[5], [TTj.[5]. 
[IS]; [12], but the connection of these tests with the classical Cramer- von 
Mises and Kolmogorov-Smirnov is not evident. One exception is the work 
jl2j . where a linear transformation of the normalized deviations Ct (■) and 
r]T (•) allowed construction of ADF tests. Unfortunately, the proof in [12] is 
not satisfactory, since it used a property of the time change in the Wiener 
integral, which is not always true. That is why we decided to suggest another 
linear transformation which leads to ADF test. 

Therefore, our goal is to find a linear transformation L (C^) of the random 
function Ct (x) such that 



where w (t) ,0 < t < 1 is a Wiener process. Then obviously the test 





P 




= e 



will be ADF. 

Let us rewrite the stochastic integral as follows 




where w {t) ,0 < t < 1 is a Wiener process 
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and we denoted 

FsAy) = s, Fs,ix)=t, a(.) = a(F^i(s)), b (s) = fs^ {F^^^ (s)) . 

Here Fg^ (s) is the function inverse to Fs^ (y), i.e., the solution y of the 
equation Fs^ [y) = s. Note that under our assumptions the function Fs^ (y) 
is strictly increasing. 
We can write 



du (t) 



1 



a{t)b{t) a{t)b{t) a{t)b{t) 

Hence the integral (understood in the mean square sense) 



dw (t) 



a (s) b (s) du (s) 



wit) 



provides us the desired transformation. Indeed, we have 



uit) 



and 

a (s) b (s) du (s) = 
Therefore, we can write 







iv) fso (y) d 



2/50 {Fs,' it)) 

C{So,y) 



2/5o (y) J 



w{Fs,, (x)) =w{t). 



^ (y) fso (y) d 



C{So,y) 



dFso (x) 



VsAy) 

This equality suggests the statistic 

"Ct(5o,I/)^^ ^ 



w {tf dt. 



(y (y) fso (y) d 



L 2/5o (y) 



dFso (x) , 



where we have to define the integral with respect to the normalized empirical 
density 



f h{x) d 


\ Ct{x) 1 


'a 


[2/50 (^) J 



If we verify the convergence St {X"^) =^ S, then the corresponding test 
{X"^) = l{SrixT)>..y P{S>Cs} = e 
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will be ADF. 

Using the representation for any piece-wise continuous function h (x) 
with bounded support and partition a = xq < xi < . . . < Xm = b we can 
write 



Xi 



Ct{x. 



Xo 



^ (y) fso (y) 



dy. 



Therefore as max — Xj| — > we obtain the limit 

Crixi+i) Crixi) 



lim y h 

Xi 

1 



^h{Xt)l{a<X, 



VtJo cr{Xt)fsAXt 



h (y) I 



{a<y<b} 



TJxo (j{,y) fsoiy) 



dy. 



If a = — oo (as in our case) then this limit exists for a class of functions 
vanishing at —oo because a (y) fs^ (y) — as \y\ — )■ oo. 

Recall that in our case h{x) = a (x) fs^ (x) and the integral 



^ iv) fso (y) d 
1 '•^ 







Vso {y) 



Xt ]J 



Xo 



{y<x} 



dy. 



Therefore, 

St (Xt) = 



-oo 
oo 



T 



^{Xt<x} dWt - 



H{x,Xo,Xt] 



T 



dFs, [x) 



T 



1 



{Xt<x} 



dWt] dFsAx) + 



where we put 



H{x,y,z) 



{v<x} 

a{v) 



dv. 



Note that the estimate 'Es^H (x, Xq, Xt)'^ < C follows directly from the 
assumption (see [TT] for details). 
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By the law of large numbers 
T 

Hence, by the central limit theorem the stochastic integral 



1 



converges 

(It (xi) , . . . , /t (xk)) =^ (/ (xi) ,...,/ (xk)) , (15) 

where 

I{x) = w{FsAx)). 

Further, (xi < X2) 

Eso \It (X2) - It (xi)l' = Fs, (2:2) - Fs, (xi) < C |x2 - Xi| (16) 

because under coondition ^0 the density (■) is a bounded function. 
The properties f|T5l) and f|T6l) yield the convergence 

[Jr (x)]' dFs, (x) = / [Jt (t)]' dt ^ w (tf dt 

-00 Jo JO 

(Theorem A22, [5]). When we know the form of the statistic 5t we can 
construct another goodness of fit test with the same asymptotic properties 
as follows. Let us introduce the statistic 

/oo / 1 pT it \ 2 

and the constant c^: 

tt; (t)^ dt > Ce 

We assume that under the nonparametric alternative 

the function S {■) satisfies the conditions £S and ^o- Here ^ is the random 
variable with the density fs (x). Therefore the observed process is ergodic 
with the invariant density fs (■). 

Then for the test ip^ = 11|^, j we have the following result. 
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Proposition 1 The test ^ifj^ E fC^ and is consistent against any fixed alter- 
native Til- 

Proof. Under hypothesis "Hq we have 



5y 



and it converges to the following limit 



-1 

/ w{tfdt. 




Therefore, the test = 1^^,^^^^ belongs to /Ce. 

The consisteny follows from standard arguments as follows. Under alter- 
native Til we can write 

The first (stochastic) integral is asymptotically normal 

/t(x) = ^ [ l{x,^,}dWt^w{Fs{x))^Af{0,Fs{x)) 
VI Jo 

and for the second we have by the law of large numbers 

Mt {x) = l=f [S (X,) - So (X,)] dt-^M {x) . 

1 Jq (T{Xt) 



Here 



M{x) = Es{^^[S{0-Som 



\h(.)\\l = / h(x?dFo(x) 



Denote 
Then we can write 

V 
> P ^ ^ 



P5 {St > q) = Ps (\\It (■) + VtMt {■] 

{VT\\Mri-)U-\\lTi-)\\o>V^)- 
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Therefore, if ||Mt(-)|Io 
The condition 11 M 



— 7- II Af (■)||g > then the test is consistent. 
■)|L = imphes equahty 



[S {y) - So {y)] 



Hence 



S (x) - So (x) 



fs (y) dy = for all x E R. 



aix 



fs (x) = for almost all x G -R, 



which is equivalent to the equality S (x) = Sq (x) for almost all x. This 
contradicts the definition of the alternative. 

We do not consider here the ADF test based on empirical distribution 
function, because the derivation of r)T (x) reduces to the test based on (x), 
for which we have already suggested a solution. 

Remark. Note that the central statistic in (fT7|) coincides with the statis- 
tic used in the Kolmogorov-Smirnov type test studied [12]. It is interesting 
to note that another Kolmogorov-Smirnov test based on empirical density 

Ct (x) = VT {Jt {x) — fso {x)j was studied in |TT]. To compare these two 
tests we put a (x) = 1. Then the test in [13] is ■0^ {^^) = ^^St>c } "^i^h 



6t = sup —= 
X x/T 



{Xt<x}dXt 



T 



^{Xt<x}So (Xt) dt 



It is shown that under hypothesis 

St =^ 

Therefore this test is ADF. 
The test proposed in [TT] is 



sup \w (t) I . 

0<i<l 



5° =sup|Ct(x)|, ^° (X^) = ]I|,.>^j. 
It is shown that this statistic is asymptotically equivalent to the statistic 



St = sup 



^{Xt<x}dXt — / lL^Xt<x}So (Xt) dt 



(19) 



and it converges to the limit 

S+ =^ sup|$(x)| fo{x) . 



The comparison of ffTSj) and f[T9l) shows the advantage of flTSl) because it is 
ADF. 
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